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Description 

METHOD AND RELATED APPARATUS FOR 
STEREO VOCAL CANCELLATION 

Background of Invention 
[000 1 ] 1. Field of the Invention 

[0002] The present invention relates to a method and related ap- 
paratus for stereo vocal cancellation, and more particu- 
larly, to a method and related apparatus, which cancels 
vocals of different stereo signals respectively. 

[0003] 2. Description of the Prior Art 

[0004] with advanced information and electronics technology, 
various types of entertainment systems are available. For 
example, karaoke systems can play background music fil- 
tered of vocals, allowing users to sing along and enjoy a 
professional entertainment environment. However, music 
provided at retail outlets generally includes vocals, so in 
order to meet the requirements of an accompaniment 
system vocal cancellation technology is used, which aims 



to attenuate the vocals and leave the background music 
intact. 

[0005] please refer to Fig.l. Fig.l shows a block diagram of a 
player 10 that performs vocal cancellation according to 
the prior art. In general, modern systems can play two- 
channel (or more) stereo sound, with different speaker 
modules of the player outputting the different stereo sig- 
nals to allow users to hear realistic sound. The player 10 
has a sound source circuit 12 to provide two stereo sig- 
nals (such as left and right stereo signals), a signal mod- 
ule 14 to perform vocal cancellation, and two speaker 
modules 16A, 16B to output the stereo sound. The sound 
source circuit 12 can be a CD reading mechanism, which 
utilizes a reading head 18 to access and data of a CD 20 
to demodulate it. The CD 20 has music data of two stereo 
channels, so the sound source circuit 12 reads two stereo 
signals PLi, PRi after accessing the data of CD 20. The sig- 
nal module 14 performs vocal cancellation for the stereo 
signals PLi, PRi to generate two output signals PLo, PRo. 
The speaker modules 16A, 16B have individual A/D con- 
verters, power amplifiers, and speakers to transform the 
output signals PLo, PRo into acoustic waves. 

[0006] | n order to perform vocal cancellation, the signal module 



14 has two high pass modules 26A, 26B, a low pass mod- 
ule 28, and a vocal cancellation module 22. The high pass 
modules 26A, 26B high pass filter the stereo signals PLi, 
PRi to generate two corresponding high pass signals PLh, 
PRh; and the low pass module 28 filters a signal Ps to 
generate a corresponding low pass signal PI. The vocal 
cancellation module generates an intermediate signal PVC 
by the difference between the two stereo signals PLi, PRi. 
The output signal PLo is generated by mixing a sum of the 
high pass signal PLh corresponding to the stereo signal 
PLi, the low pass signal PI, and the intermediate signal 
PVC. The output signal PRo is generated by mixing a sum 
of the high pass signal PRh corresponding to the stereo 
signal PRi, the low pass signal PI, and the intermediate 
signal PVC. 

[0007] jo illustrate vocal cancellation of the mentioned prior art, 
please refer to Fig. 2. Fig. 2 shows the spectrum of each 
stereo signal, with each horizontal axis of the spectrum 
being frequency, and each vertical axis of the spectrum 
being amplitude. 

[0008] Generally speaking, commercially produced music estab- 
lishes stereo sound by mixing different signals of back- 
ground music. The vocal track is mixed into each stereo 



signal with equal intensity. When a user plays the stereo 
signals with the speaker modules, they hear the vocals as 
being ahead because the components of the two stereo 
channels are equal. Different kinds of background music 
in each of the stereo channels makes the user hear the 
stereo effect, as if the background music is around the 
user. In Fig. 2, a spectrum Vf represents the vocal spec- 
trum, and spectrums Lmf, Rmf individually represent the 
different spectrums of background music. As mentioned 
above, the sum of the background music spectrum Lmf 
and the vocal spectrum Vf generates a spectrum of a left 
stereo signal Lf, and the sum of the background music 
spectrum Rmf and the vocal spectrum Vf generates a 
spectrum of a right stereo signal Rf. Like the stereo sig- 
nals PLi, PRi accessed from the sound source circuit 12 in 
Fig.l, their spectrums may be shown as the spectrums Lf, 
Rf. Because of the physical limitation of the human voice, 
which produces vocals not below a specific low frequency 
or exceeding a specific high frequency, the vocal spec- 
trum is usually limited to a specific bandwidth. The fre- 
quencies fl, fh denoted in Fig. 2 individually represent the 
lower bound and the upper bound of the human vocal 
spectrum. The vocal spectrum Vf is concentrated on the 



intermediate-frequency band BM between the frequencies 
fl and fh. In contrast to the vocal spectrum Vf limited to 
the intermediate-frequency band BM, the spectrum of 
background music produced by all kinds of musical in- 
struments is of a broader bandwidth. Shown in Fig. 2, the 
spectrums Lmf, Rmf of background music spread into the 
low-frequency band BL below the frequency fl, and into 
the high-frequency band BH above the frequency fh. In 
addition to the intermediate-frequency band BM that the 
vocal spectrum is located in, each signal spectrum of the 
stereo channel Lf, Rf is also expanded into the low- 
frequency band BL and the high-frequency band BH. 
[0009] Because each stereo signal has the same vocal signals, the 
signal module 14 (please refer to Fig.l) subtracts the 
stereo signal PRi from the stereo signal PLi in the vocal 
cancellation module 22 to remove the common vocals of 
the two stereo signals and generate the intermediate sig- 
nal PVC. The stereo signals PLi, PRi located in the low- 
frequency band BL and the high-frequency band BH are 
reduced in the subtraction process, and the vocal cancel- 
lation should contain the components of background mu- 
sic in the low-frequency band BL and the high-frequency 
band BH. Thus, the signal module 12 uses the high pass 



modules 26A, 26B and the low pass module 28 to perform 
high-frequency compensation and low-frequency com- 
pensation. The high pass module 26A extracts the com- 
ponents of the stereo signal PLi in the high frequency 
band BH to generate the high pass signal PLh. The signal 
source Ps of the low pass module 28 may be one of the 
stereo signals PLi, PRi. The low pass module 28 extracts 
the component of the signal Ps in the low-frequency band 
BL to generate the low pass signal PI. The mixing sum of 
the high pass signal PLh, the low pass signal PI, and the 
intermediate signal PVC can compensate for the high and 
low-frequency components lost in the vocal cancellation 
to generate the output signal PLo. 
[0010] | n the same way, after the high pass module 26B extracts 
the high-frequency components of the stereo signal PRi to 
generate the high pass signal PRh, the signal module 12 
can use the high pass signal PRh and the low pass signal 
PI to perform high and low-frequency compensations for 
the intermediate signal PVC in order to generate the out- 
put signal PRo. In general, each stereo signal in the low- 
frequency band BL does not have direction, so it is diffi- 
cult to build the stereo sound effect by the difference of 
the stereo signals PRi, PLi in the low-frequency band. 



Thus, the signal module 14 uses the same low-frequency 
signal PI to perform low-frequency compensation for the 
output signals PLo, PRo. In contrast, each stereo signal in 
the high-frequency band BH has direction, and the differ- 
ence of the stereo signals in the high-frequency band BH 
allows the user to hear the stereo sound effect. Thus, the 
signal module 14 individually uses the high pass signals 
PRh, PLh, high pass filtered by the two stereo signals PRi, 
PLi, to perform high pass compensation, as well as utiliz- 
ing the difference of the output signals PRo, PLo in the 
high-frequency band to produce the stereo sound effect. 
In summary, the signal module 14 receives two stereo 
signals PLi, PRi, uses the vocal cancellation module 22 to 
generate the intermediate signal PVC as the essential re- 
sult of vocal cancellation, uses the low pass signal PI and 
the high pass signals PLh, PRh as the low and high- 
frequency compensations respectively, and individually 
generates the output signals PLo, PRo as the results of the 
stereo signals PLi, PRi after vocal cancellation. The signal 
module 12 attenuates the vocals of the two stereo signals 
while somewhat preserving the stereo sound effect of the 
background music in the output signals PLo, PRo. 
1 ] Please refer to Fig. 3. Fig. 3 shows the spectrum when the 



signal module 14 of Fig.l operates. In Fig. 3, each hori- 
zontal axis is frequency and the vertical axis is magni- 
tude. Continuing the spectrum example in Fig. 2, if the 
spectrums of the stereo signals PLi, PRi in Fig.l are the 
spectrums Lf, Rf in Fig. 2, after the signal module 14 oper- 
ates the spectrums of the output signals PLo, PRo are as 
the spectrums PLof, PRof in Fig. 3. The frequencies fl, fh, 
the low-frequency band BL, the intermediate-frequency 
band BM, and the high-frequency band BH denoted in 
Fig. 3 are the same as those of Fig. 2. To compare the dif- 
ference of two spectrums PLof and PRof, Fig. 3 illustrates 
the spectrum PRof with a dotted line and the spectrum 
PLof with a solid line. 
[0012] Because the output signals PLo and PRo generated by the 
signal module 14 in Fig.l have the same intermediate sig- 
nal PVC and the same low pass signal PI, the different 
parts are the different high pass signals PLh, PRh for high 
pass compensation. Compared with the spectrums PLof, 
PRof of the output signals PLo, PRo in Fig. 3, the main dif- 
ference is concentrated in the high-frequency band BH, 
the components of the intermediate-frequency band BM 
and the low-frequency band BL of two spectrums PLof, 
PRof being the same. Although the high-frequency com- 



ponents of the signals have the stereo sound effect, most 
of the energy of the spectrums PLof, PRof is concentrated 
in the intermediate-frequency and low-frequency bands 
BM, BL. The signal energy distributed in the high- 
frequency band BH is relatively low, which results in little 
difference between the spectrums PLof, PRof. As such, 
when the player 12 outputs the output signals PLo, PRo, 
the stereo sound effect is not as expected. This is one 
disadvantage of the prior art. In other words, in the prior 
signal module 14 of Fig.l, the output signals PLo, PRo of 
the two stereo channels both use the same intermediate 
signal PVC as the essential signal of vocal cancellation, 
and only use the different high pass signals PLh, PRh as 
the high pass compensation. The difference of the output 
signals PLo, PRo in only concentrated in the high- 
frequency and less-energy portions, and is not significant 
enough to generate an obvious stereo effect. This reduces 

stereo sound effect quality, and lessens user enjoyment. 
Summary of Invention 

[0013] it is therefore a primary objective of the claimed invention 
to provide an improved method and related apparatus of 
vocal cancellation that leaves the stereo signal difference 
intact to generate better stereo sound after vocal cancel- 



lation to solve the above-mentioned problem. 

[0014] | n t he prior art, an intermediate signal generated by two 
stereo signals is the main signal of vocal cancellation. 
With the intermediate signal, the output signals of two 
stereo channels are generated after low-frequency com- 
pensation and separate high-frequency compensations. 
Because the output signals of two stereo channels have 
the same intermediate signal, the difference of them is 
limited only to the high-frequency components, resulting 
in low-quality stereo sound. 

[0015] According to the claimed invention, the average of two 

stereo signals generates a mono signal, and the difference 
between each stereo signal and the mono signal is the 
corresponding intermediate signal of vocal cancellation of 
each stereo signal. The low-frequency compensation and 
the corresponding high-frequency compensation for the 
corresponding intermediate signal of each stereo signal 
generate the corresponding output signal. In this method, 
the corresponding intermediate signal of each stereo sig- 
nal is generated by the difference between the stereo sig- 
nal and the mono signal, so the corresponding intermedi- 
ate signal of each output stereo signal is also different. 
Even after vocal cancellation, the differences of each 



stereo signal in the low-frequency and intermediate-fre- 
quency bands are preserved, which makes the output sig- 
nals have an improved stereo sound effect allowing users 
to better enjoy the accompaniment of stereo sound. 
[0016] These and other objectives of the claimed invention will 
no doubt become obvious to those of ordinary skill in the 
art after reading the following detailed description of the 
preferred embodiment that is illustrated in the various 

figures and drawings. 
Brief Description of Drawings 

[0017] pig.l is a block diagram of a prior art music player 

adapted to vocal cancellation. 
[0018] pig. 2 is a spectrum diagram of each stereo signal of the 

player of Fig.l. 

[0019] Fig. 3 is a spectrum diagram of each output signal of the 
player of Fig.l. 

[0020] Fig. 4 is a block diagram of a player capable of vocal can- 
cellation according to the present invention. 

[0021] Fig. 5 is a spectrum diagram of each output signal of the 
player of Fig. 4. 

[0022] Fig. 6 is a program code to implement the signal module 

of Fig. 4. 
Detailed Description 



[0023] please refer to Fig. 4. Fig. 4 is a block diagram of a player 
30 capable of vocal cancellation according to the present 
invention. The player 30 has a sound source circuit 32, a 
signal module 34, and two speaker modules 36A, 36B to 
play stereo sound. The sound source circuit 32 is a CD 
read mechanism, utilizing a read head 38 to read the sig- 
nal data from a CD 40, as well as processing the signal 
data to output each stereo signal Li, Ri. The signal module 
34 is used to realize vocal cancellation of the present in- 
vention, outputting two output signals Lo, Ro having vo- 
cals cancelled according to two stereo signals Li, Ri. The 
signal module 34 has a mono processing module 50 and a 
low pass module 48. To match two stereo signals Li and 
Ri, the signal module 34 also has two vocal cancellation 
modules 42A, 42B and two high pass modules 46A, 46B. 
The speaker modules 36A, 36B have individual D/A con- 
verters, power amplifiers, and speakers to transform the 
output signal Lo, Ro into acoustic waves. 

[0024] The process of vocal cancellation with the signal module 
34 according to the present invention is described as fol- 
lows. The mono processing module 50 of the signal mod- 
ule 34 can calculate the average of the stereo signals Li, Ri 
to generate a mono signal M, such that M=(Li + Ri)/2. The 



present invention utilizes the mono signal M to perform 
vocal cancellation for each stereo signal. In the vocal can- 
cellation module 42A corresponding to the stereo signal 
Li, the stereo signal Li minus the mono signal M generates 
the mediate signal LVC (LVC=Li-M). In the vocal cancella- 
tion module 42B corresponding to the stereo signal Ri, the 
difference of the stereo signal Ri and the mono signal M 
generates the intermediate signal RVC (RVC=Ri-M). 
[0025] As illustrated in Fig. 2, the vocal signal is usually mixed 

into the stereo signal with equal magnitude. In the present 
invention, the mono signal generated by the average of 
the stereo signals Li, Ri has the same vocals as that of 
each stereo signal. The present invention utilizes each vo- 
cal cancellation module corresponding to each stereo sig- 
nal to subtract the mono signal from each stereo signal, 
individually performing vocal cancellation for each stereo 
signal. The difference from the prior art is that the present 
invention individually performs vocal cancellation accord- 
ing to each stereo signal, so the corresponding intermedi- 
ate signals generated by different stereo signals after vo- 
cal cancellation are different. Referring to the example of 
Fig. 4, the intermediate signal LVC of the stereo signal Li 
after vocal cancellation is equal to Li-M, and the other in- 



termediate signal RVC of the stereo signal Ri after vocal 
cancellation is equal to Ri-M and is different from the in- 
termediate signal LVC. As the discussed above, the stereo 
sound effect is caused by the difference of the stereo sig- 
nals. In the present invention, the signal difference of the 
stereo sound effect generated by the stereo signals Li and 
Ri is preserved in the intermediate signals LVC and RVC. 
The present invention mainly utilizes the signal difference 
of the intermediate signals LVC and RVL to generate im- 
proved vocal-cancelled stereo sound compared to the 
prior art. Note that in the prior technology of vocal can- 
cellation shown as Fig.l, the different stereo signals use 
the same vocal cancellation module to perform vocal can- 
cellation with the same intermediate signal as the essen- 
tial result of vocal cancellation. Compared with the prior 
art, the present invention individually performs vocal can- 
cellation according to the different stereo signals, which 
further preserves the signal difference of each stereo sig- 
nal to better reproduce the stereo effect. 
[0026] As shown in Fig. 4, after generating the intermediate sig- 
nals LVC and RVC according to the stereo signals Li and 
Ri, the signal module 34 can perform high-frequency and 
low-frequency compensation according to the intermedi- 



ate signals LVC and RVC to generate the output signals Lo 
and Ro. The high pass module 46A can extract the portion 
of the stereo signal Li in the high-frequency band (mainly 
the portion higher than the intermediate-frequency vocal 
band, please refer to Fig. 2 and the related description) as 
the high pass signal Lh. The low pass module 48 can ex- 
tract the portion of the signal S in the low-frequency band 
as the low pass signal SI. The signal S can be one of the 
stereo signals Li and Ri, or the mono signal M. Mixing the 
high pass signal Lh corresponding to the stereo signal Li, 
the intermediate signal LVC, and the low pass signal SI 
with the mixing unit 52A is equal to high and low- 
frequency compensation for the intermediate signal LVC, 
the mixing unit 52A also generating the output signal Lo 
corresponding to the stereo signal Li (Lo=LVC+SI + Lh). In 
the same way, the high pass module 46B can extract the 
portion of the stereo signal Ri in the high-frequency band 
as the high pass signal Rh, which allows high-frequency 
compensation for the intermediate signal RVC. Mixing the 
high pass signal Rh corresponding to the stereo signal Ri, 
the intermediate signal RVC, and the low pass signal SI 
with the mixing unit 52B is equal to high and low- 
frequency compensation for the intermediate signal RVC, 



the mixing unit 52B also generating the output signal Ro 
corresponding to the stereo signal Ri (Ro=RVC+SI + Rh). 
[0027] please refer to Fig. 5. Fig. 5 shows the spectrum of the out- 
put signals Lo and Ro generated by the operation of the 
signal module 34 according to the present invention. The 
horizontal axis in Fig. 5 is frequency, and the vertical axis 
is magnitude. To continue the example in Fig. 2, if the 
spectrums of the stereo signals Li, Ri in Fig. 4 are shown 
as the spectrums Lf, Rf in Fig. 2, the spectrums of the out- 
put signals Lo, Ro of the present invention are shown as 
the spectrums Lof, Rof in Fig. 5 (the spectrum Rof is a dot- 
ted line and the spectrum Lof is a solid line, and the fre- 
quencies fl, fh and the frequency bands BL, BM, BH in 
Fig. 2 are also shown in Fig. 5). In Fig. 5, the present inven- 
tion individually performs vocal cancellation according to 
the different stereo signals, so the differences of the 
stereo signals distributed in the low-frequency band BL 
and the intermediate-frequency band BM are also pre- 
served between the output signals Lo and Ro. This makes 
the output signals Lo, Ro of the present invention differ- 
ent not only in the high-frequency band BH but also in the 
low and intermediate-frequency bands. As a result, when 
the player 30 of the present invention outputs the output 



signals Lo, Ro with the speaker modules 36A, 36B, users 
can hear more abundant accompaniment with a more pro- 
nounced stereo sound effect and enjoy an improved ac- 
companiment environment. 
[0028] Each functional block of the signal module 34 of the 

present invention in Fig. 4 can be implemented in the form 
of hardware, firmware, and/or software. For example, 
general players have programmable signal processing cir- 
cuits, and the present invention can be implemented by 
firmware storing program code in the player memory. 
When the signal processing circuit executes such program 
code, vocal cancellation according to the present inven- 
tion is effected. In addition, the present invention can be 
implemented to cancel vocals and offer background music 
in the form of software, such as the computer music play- 
ing programs, which usually cooperate with peripheral ap- 
paratuses (such as sound cards and CD players) to play 
music. Please refer to Fig. 6. A program code 100 is used 
to implement vocal cancellation according to the present 
invention. The array variables x_L and x_R are used to 
represent the different stereo signals Li and Ri (as in 
Fig. 4), and the array variable Mono represents the mono 
signal M. A subroutine HLPass is used to implement the 



high pass modules, while a subroutine Low_Pass is used 
to implement the low pass modules. The array variables 
h_L and h_R individually represent the high pass signals 
Lh and Rh, the array variable low represents the low pass 
signal SI, and the array variables L_out and R_out individ- 
ually represent the output signals Lo and Ro. The integer 
pointer j of program code 100 represents the j-th value of 
a variable array, and it also corresponds to the j-th sam- 
ple of the corresponding signal of the variable array. 
Shown in the program code 100, the mono signal repre- 
sented by the variable Mono is the average of the vari- 
ables x_L and x_R that correspond to each stereo signal, 
the high pass filtered results of the variables x_L and x_R 
are individually stored in the variables h_L and h_R. The 
stereo signal Ri represented by the variable x_R is the sig- 
nal S in Fig. 4, and the low pass signal SI generated by low 
pass filter is represented by the variable low. Finally, the 
intermediate signals LVC and PVC are individually imple- 
mented by an operation x_L[j]-lvlono|j] and x_R[j]-Mono|j] 
of the program code 100. The sum of the intermediate 
signals, the variable of low-frequency compensation low, 
and the variables of high-frequency compensation h_L 
and h_R can generate the output signals of vocal cancella- 



tion according to the present invention, which are individ- 
ually stored in the variables Lout and R_out. 

[0029] | n vocal cancellation of the prior art, because the output 
signals of the different stereo channels use the same in- 
termediate signal as the main result of vocal cancellation, 
the signal difference of the output signals in the low and 
mediate-frequency bands outside the high-frequency 
band is not clear to a listener. Thus, the output signal of 
each stereo channel in the prior art does not produce a 
stereo sound effect of suitable quality. Compared with the 
prior art, the present invention individually performs cor- 
responding vocal cancellation according to the different 
stereo signals, so the output signal of each stereo channel 
preserves the whole signal difference. When the different 
speaker modules output the output signals, an improved 
stereo sound effect can be heard allowing users to better 
enjoy the accompaniment. In addition to typical CD play- 
ers, the present invention can apply to other types of 
players, such as network modules that play music through 
a wired or wireless network. 

[0030] Those skilled in the art will readily observe that numerous 
modifications and alterations of the device may be made 
while retaining the teachings of the invention. Accord- 



ingly, that above disclosure should be construed as lim- 
ited only by the metes and bounds of the appended 
claims. 



